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which may additionally provide a basis for therapy. 
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Description 

MOLECULAR MARKERS 

The present invention relates to markers associated with malignant tumours such as breast tumours but 
especially colorectal tumours and metastases as well as to the use of such markers in the diagnosis and 

5 treatment of a range of neoplastic conditions and predispositions thereto. Such markers may, for example, 
take the form of genomic DNA sequences, RNA sequences obtainable from said DNA sequences, cDNA 
sequences obtainable from said RNA sequences or fragments of any such sequences as well as polypeptides 
defined by the coding portions of such sequences and antibodies obtainable by immunisation with any such 
polypeptide or polypeptide fragment. 

10 One of the major causes of failure in the treatment of colorectal cancer is the occurrence of metastatic 
disease, involving primarily the liver and lungs, and occasionally bone. By the time of first presentation, 
between 15-250/o of patients have metastases to the liver (Welch and Donaldson, 1987). while it has been more 
recently shown that about 300/o of patients undergoing apparently curative resection for colorectal cancer 
possess occult hepatic metastases (Finley and McArdle, 1982). Furthermore, in their study of occult 

15 metastatic disease using computerised tomography (Finley and McArdle. 1982) it was shown that the 
presence or absence of metastatic disease at the time of clinical presentation is the most critical prognostic 
factor, accounting almost entirely for the observed pattern of mortality. The identification of those patients with 
occult metastatic disease is clearly of considerable importance in planning therapy and. in addition, would 
avoid the unnecessary further treatment of that group of patients surgically cured of their disease (Taylor et al. 

20 1985). 

The aggressiveness of colorectal tumours is currently assessed using the Dukes classification (Dukes. 
1932). However, this, together with other prognostic indicators such as tumour morphology and serum levels 
of carcinoembryonic antigen (CEA), do not reliably correlate with clinical outcome (Finley and McArdle, 1982; 
Lewi et al. 1984). Measurement of DNA distribution patterns in tumour ceil nuclei suggest that tumour ploidy 

25 may be of prognostic value, non-diploid tumours tending to be more aggressive (Wolley et al, 1982). The few 
clinically applicable markers that exist for colorectal cancer are of little specificity either for the diagnosis or for 
monitoring the course of the disease (Schwartz, 1980). although it has been suggested that serum levels of 
CEA (Tate. 1982) and alkaline phosphatase (Asbo et al, 1986) may be used as indicators of recurrence and of 
secondary disease. However, these markers do not allow a distinction between recurrent local and metastatic 

30 disease (Hine and Dykes. 1984). 

Although many features of tumour cells have been studied in relation to metastasis (reviewed in Weiss, 
1985), as yet no single variable has been consistently identified as being associated with the metastatic 
phenotype and there is no clinically reliable means of predicting the metastatic potential of a tumour. 
Metastasis is considered to be a multistep process (reviewed In Hart and Rdler. 1980; Nicolson, 1982; 

35 Schirrmacher. 1985) involving many phenotypic characteristics expressed as a consequence of the activity of 
many gene loci, and would be expected to be reflected in changes in the relative abundances of specific 
mRNAs- 

Variations in abundance of individual mRNAs can be studied by the application of molecular cloning 
techniques which allow the identification of previously uncharacterised genes that are associated with a 
40 particular cell phenotype. Cloned complementary DNAs (cDNAs) representing specific abundant mRNAs have 
been used, for example, to identify sequences associated with normal development (Sim et al, 1979), 
transformation (Augenlicht and Kobrin. 1982). and to identify further markers to supplement those used in 
classification of leukaemias (Weidemann et al. 1983; Wamock et al, 1985; Mars et al, 1985). 

The present invention is based at least in part on the discovery of nucleotide sequences and proteins that 
45 are differentially expressed during malignant tumour progression and metastasis for example in breeist cancer, 
but especially in colorectal cancer and which may thus serve inter alia as general molecular markers in primary 
and metastatic neoplastic disease for example in breast cancer, and especially in colorectal cancer, as well as 
providing a basis for therapy. The present invention also relates to methods of detecting such markers. 
Thus according to one feature of the present invention there is provided a polynucleotide sequence which is 
50 differentially expressed during malignant tumour progression and metastasis in colorectal cancer and 
fragments thereof. 

Such polynucleotide sequences will in general be in isolated or cloned form and will include genomic DNA 
sequences and corresponding RNA sequences as well as cDNA sequences derived from the aforementioned 
RNA sequences and any fragment of such sequences. Such cDNA sequences may for example be obtained 
55 from RNA by the use of reverse transcriptase. 

Such genomic DNA sequences and corresponding RNA sequences are preferably in substantially pure 
form. 

The fragments of the aforementioned polynucleotide sequences and cDNA sequences will in general 
consist of at least 8 consecutive nucleotides, preferably at least 10, more preferably at least 12 , nucleotides of 
60 the polynucleotide sequence. 

The polynucleotide sequences of the present invention are preferably characterised in that the cDNA 
sequences corresponding thereto contain the following sequence: 
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5 '-AGCGTGGGTA TCGAGGCGGA CGACGACCGG CTCAACAAGG TTATCAGTGA GCTGAATGGA 



AAAAACATTG AAGACGTCAT TGCCCAGGGT ATTGGCAAGC TTGCCAGTGT ACCTGCTGGT 

o 

GGGGCTGTAG CCGTCTCTGC TGCCCCAGGC TCTGCAGCCC CTGCTGCTG GTTCTGCCCC I 

10 

TGCTGCAGCA GAGGAGAAGA CAGATGAGAA GAAGGAGGAG TCTGAAGAGT CAGATGATGA 

CATGGGGATT TGCCTTTTTG ATTAAATTCC TGCTCCCCTG CAATAACCTT. TTTACACATC 75 



TTA-3' end 



20 



It will be appreciated that the fragment of formula I of the present invention may be used to identify mRNA 
species corresponding thereto in Northern blot analyses. 

The polynucleotide sequences of the present invention may also be characterised in that a cDNA sequence 
derived from said polynucleotide sequences comprises the sequence insert in pLM59 (NCIB No. 1 2429) . 

pLM59 has been deposited under the Budapest Treaty with The National Collections of Industrial & Marine 
Bacteria Ltd, Torry Research Station. PO Box No. 31. 135 Abbey rtoad. Aberdeen. AB9 8DG. Scotland under 
the deposition number NCIB 12429. The deposition date in respect of pLM59 is 19th March 1987. pLM59 
consists of transformed E.coli JM83 (ATCC NO 35607) See Gene 19. p259-268. 1^2 and 25 p241-247, 1983. 
The insert sequence in pLM59 maybe defined by EcoR! and BamHI restriction sites at tenminal ends and by the ^ 
length of the fragment (see Table 2) 

It will be appreciated that the insert sequence In pLM59 or the polynucleotide sequence of formula I 
hereinbefore defined or a fragment of such sequences having at least 8. preferably at (east 10, more preferably 
at least 12 especially at least 14 consecutive nucleotides may be used to obtain the corresponding genomic 
DNA or RNA sequences of humans and animals after such sequences have been closed in appropriate ^ 
vectors using standard techniques known in the art (see for example T Maniatis et al; Molecular Cloning A 
Laboratory Manual, Cold Spring Harbor Laboratory. 1982). Polynucleotide sequences of the present invention 
may also be prepared by direct chemical or enzymatic synthesis or microbiological reproduction. 

It will also be appreciated that polynucleotide sequences of the present invention define the sequences of 
polypeptides which are encoded therein. The expression of such polypeptides may itself constitute a useful ^ 
marker in the investigation of malignant disease. Such polypeptides may have some biological role in the 
development of malignant disease and interference with this function may be useful in therapy of malignant 
disease. 

It will be appreciated that the aforementioned molecular markers may be determined in a number of different 
ways. Thus for example polynucleotide probes may be constructed which are capable of hybridisation to any 
portion of the genomic DNA precursor of the aforesaid RNA sequence including introns and non-coding as 
well as coding portions of the DNA sequence. 

Polynucleotide probes may also if desired be constructed which are capable of hybridisation to any portion 
of the aforesaid RNA sequence, regardless of whether the portion is capable of translation into a polypeptide 
or not. Moreover, if desired the molecular marker in the form of an RNA sequence may be transcribed into a ^ 
corresponding cDNA sequence using for example reverse treinscriptase and the molecular marker determined 
by the use of a polynucleotide probe capable of hybridising to any portion of the cDNA sequence. It will be 
appreciated that the polynucleotide probe will compromise a nucleotide sequence capable of hybridisation to 
a sufficient length of the sequence to be determined to ensure that the probe unamblgously detects the 
sequence of interest. In general the probe will be capable of hybridising to at least 8 consecutive nucleotides ^ 
of the sequence to be determined, preferably to at least 10 consecutive nucleotides, more preferably to at 
least 12 consecutive nucleotides and especially to at least 14 consecutive nucleotides^ ' 

Thus according to one feature of the present invention there is provided a polynucleotide probe whteh 
comprises a nucleotide sequence capable of hybridising to a polynucleotide of the present invention or portion 
thereof said probe optionally having a labelled or marker component. ^ 

As stated above the polynucleotide probes of the present invention will In general be capable of hybridising 
to at least 8 consecutive nucleotides of the polynucleotides of the present Invention, preferably at least 10 
consecutive nucleotides, more preferably to at least 12 consecutive nucleo|ides and especially to at least 14 
consecutive nucleotides. 

The polynucleotide probes of the present invention may be labelled or marked according to techniques ^ 
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known in the art. for example, sap-radiolabelled in any conventional way. or alternatively radiolabelled by other 
nSs wen known in the ^bridisation art for example to give 3ss-radiolabelled P-^es. The probes may for 
example carry fluorescent markers. They may alternatively be labelled with b.ot.n or a similar spec.es by the 
method of DC Ward et al. as described in Proceedings of the 1981 ICM-UCLA Symposium on Development 
Bto oS using Purified-Gi-nes held in Keystone. Colorado on March 15-20. 1981 voK )0(1I 1981 pages 647-658 
Academic Press; Editor Donald D Brown et al or even enzyme-labelled by the method of A D B Malcolm et a}, 
Abstracts of the 604th Biochemical Society Meeting. Cambridge. England (meeting of 1 July 198.i). 

The aforementioned molecular markers may also be determined by the use of antibodies, which may be 
polyclonal but are preferably monoclonal, raised to a polypeptide sequence coded for by at least a portion of 
the aforementioned genomic DNA sequence or corresponding RNA sequence. The antibody may thus bind to 
the protein encoded by the aforementioned genomic DNA sequence or corresponding RNA sequences or 
bind to any fragment of the protein. ^.^ ^ ^ ^- w:^^ „♦ 

Thus according to a further feature of the present invention there is provided an antibody effective to bind at 
least a fraqment of the polypeptide encoded by the polynucleotide of the present invention. The term 
'antibod^l^Csed herein incL^^^ 

for antigenic determinants of polypeptides of the present Invention. 

The antibody of the present invention may if desired cany a label or marker component for example as 
hereinbefore described in relation to the polynucleotide probes of the present invention/Thus the antibodies 
mav for example carry a fluorescent marker. It is not however necessary that the antibody of the present 
Invention cany a label or marker component. Thus for example the antibody of the present invention may be 
detected by a second antibody which is an antibody to antibodies of the species of the antibodies of the 
present invention for example goat antimouse immunoglobulin. The second antibody will have a labelled or 

""TSpoSJIucSdes. polynucleotide probes polypeptides and antibodies of the present Invention may find 
25 use in the following areas:- ^ ^ . _ , ^.^ 

1) Serological diagnosis - for example testing patients, e.g. predisposed to malignaricies for the 
presence of the protein (encoded by the polynucleotides of the present invention) or antibodies thereto in 
blood urine or other body fluids, tissue or excretion products ; 

2) Immunohistochemistry applications - for the diagnosis of maUgnant disease in tissue samples 

30 3) Diagnostic Imaging - in which case the antibody or probe will have an appropnate label or marker, for 

example aradioactive label or marker; . . 

4) Therapy -a) for example antibodies of the present invention may form part of an immuriotoxin. 
sometimes termed the "magic bullet", in order to deliver toxic agents of drugs such as plant toxii^ e.g 
ricin preferentially to the site of a malignant or even benign tumour (see for example European Patent 

35 Application No. 84304801.8 -Publication No. 01451 11); 

b) for example the antibodies of the present invention may be useful as a thereapeuto; 

c) for example polynucleotides of the present invention may be useful alone in therapy as anti sense 
DNA or RNA Thus polynuleotides of the present invention, optionally in a vector or in a polynucleotide 
analogue, which contains sequences complementary to DNA or RNA defining a protein which is 

40 differentially expressed during cancer progression and metastasis or portion thereof may be employed to 
prevent expression ofthe said protein; 

5) Histological analysis - polynucleotide probes (DNA or RNA) having an appropriate label or marker 
may be useful in in situ hybridisation for histological analysis. .^^ ♦ 

6) Determinationof predisposition to genetic disease -for example the polynucleotde of the present 
45 invention (DNA or RNA) may be useful in the analysis of restriction enzyme fragment length or other 

polymorphisms associated with a predisposition to malignant disease. Furthermore, detection of 
mutations within the polynucleotide sequences of the present invention in individuals may con-elate with a 
predisposition to malignant disease. ^ u • ^i««,„.„«,iu. 

Whilst the nucleotide sequences of the present invention wrere onginally identified as being differentially 
so expressed during malignant tumour progression and metastases in colorectal cancer, the nucleotide 
sequences have been found to be additionally associated writh malignant breast disease and metastases. The 
polynucleotide sequences of the present invention and fragments thereof, the polypeptides of the present 
invention and fragments thereof and the antibodies of the present invention are thus of Interest as genera^ 
markers in primary and metastatte neoplastic disease and not only in relation to malignant breast disease and 
55 colorectal cancer and metastases associated therewith. 

Brief description of the drawings , 

Figure 1 shows Northern blot analyses of normal mucosa RNA. Total RNA (10 ^g per lane) from a 
sample of normal colonic mucosa was electrophoretically fractionated on a 1% agarose-formaldehyde 
60 gel and transferred to nitrocellulose. Individual lanes were hybridised wtth labelled recombinant plasmid 

probes pNM19.pNM32.pNM41.p.NM61 and pLM59 as indicated (19.32.41. 61.59 respectvely). 

Figure 2 shows Northern blot analyses of RNAs from mucosae, primary colon tumours and liver 
metastases. Total RNA (10 m per lane) was electrophoretically fractionated on a 1o/o agarose-formalde- 
hyde gel. transfen^d to nitrocellulose and hybridised with 32p^abelled recombinant pLM59 DNA RNAs 
65 were from: lanes 1-4. normal colonic mucosae; lanes 5-7. primary tumours; lanes 8 and 9. liver 
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metastases; lane 10. normal human liver. RNAs in lanes 1, 2 and 7 were prepared from tissue samples 
obtained from patients with confirmed metastatic disease. RNAs in lanes 3-6 were prepared from tissue 
samples obtained from patients with no evidence of secondary disease at the time of surgery. 

Figure 3 shows the relative abundance of pNM32 RNA at different stages in colorectal tumour 
progression. The relative abundance of RNA homologous to recombinant pfasmid pNM32 in tissue 5 
specimens representing different stages of colorectal tumour progression was determined by 
doubling-dilution RNA dot-biot hybridisation to ^p-labelled plasmid DNA. Total RNAs at a concentration 
of 500 M-g/ml were diluted and applied to nitrocellulose as described (see Materials and Methods) In a 
volume of 4 |il, the first dot in each series thus representing 2|ig of total RNA. 
It may be advantageous to present the polynucleotide probes and/or antibodies of the present invention in io 
the form of diagnostic kits and kits are regarded as further features of the present invention. 

Thus according to a further feature of the present invention there is provided a kit for detecting 
polynucleotides of the present invention which comprises a polynucleotide probe as hereinbefore defined, 
optionally in labelled or marked form. The kit may additionally contain means for labelling or marking the probes 
either prior to or subsequent to hybridisation, where such probes are not already lat>eHed or mart<ed. The kit 15 
may also contain means for detecting said label or marker. If desired the kit may contain enzymes such as DNA 
polymerase or enzymes for introducing appropriately labelled nucleotides into DNA or RNA probes. The kit 
may contain restiction endonucleases and other appropriate materials for performing analyses of RFLP's. 

According to a further feature of the present invention there is provided a kit for detecting the polypeptkie of 
the present invention or fragments thereof which comprises an antibody of the present invention as 50 
hereinbefore defined, optionally in labelled or marked fonii. Where the antibody is not in labelled or marked 
form the kit may contain a second antibody which is an antibody to antibodies of the same species as the 
unlabelled antibody as hereinbefore described. The second antibody will be labelled or marked and the kit may 
include a format appropriate for effecting the determination for example as described in US Patent 
No. 4,376.1 10. The kit may also contain apparatus for the preparation of histological samples for analysis using 25 
antibodies of the present invention. 

If desired the polypeptide of the present invention or fragments thereof may be useful as standards in 
analysis of samples by physical techniques, for example HPLC. TLC or other chromatographic and/or 
spectroscopic techniques. 

According to a futher feature of the present Invention there is provided a kit for manufacturing the 30 
polynucleotide of the present invention which kit comprises microorganisms containing vectors capable of 
producing the polynucleotides of the present invention. 

For diagnostic imaging the polynucleotide prc^e or antibody of the present Invention will have an 
appropriate labelled or marker component, for example a radioactive label or marker, and will conveniently be 
presented in a form suitable for ingestion or injection. 35 

The antibodies of the present invention may also be of interest in purifying a polypeptide of the present 
invention and accordingly we further provide a method of purifying a polypeptide of the present invention as 
hereinbefore defined or any portion thereof or a metabolite or degradation product thereof which method 
comprises the use of an antibody of the present invention. 

The purification method of the present invention may be effected by any convenient technique known in the 40 
art for example by providing the antibody on a support and contacting the antibody wfth a solution containing 
the polypeptide whereby the antibody binds to the polypeptide of the present invention. The polypeptide may 
be released from binding with the antibody by known methods for example by changing the ionic strength of 
the solution in contact with the complex of the polypeptide/antibody. 

The following non-limiting Example is provided in order to illustrate the present invention:- 45 

EXAMPLE 1 

MATERIALS AND METHODS 
Tissues 

Specimens of histologically confirmed adenomatous polyps, colorectal tumours, and fiver metastases from 
colorectal tumours were obtained from patients undergoing surgery at Glasgow Royal Irrfirmary. Specimens of 
histologically normal colonic mucosae were obtained from tissue adjacent to the resection margins of 
surgically removed colorectal tumours. All tissues were immediately frozen In liquid nitrogen and stored at 55 
-70*'C until required. 

Isolation of totat RNA 

Total RNA was isolated from the frozen tissue specimens by a modification of the method of Chirgwin et al 
(1979). which yields undegraded total RNA suitable for the isolation of poIy(A)+ RNA. overcoming the high level 6Q 
of activity associated with endogenous RNAases in these tissues. A sample (about 0.51 g)of the tissue 
specimen was ground to a fine powder under liguid nitrogen in a pre-cooled porcelain mortar and pestle. The 
ground tissue was lysed by transfer to 20 ml of guanldinium thiocyanate solution (5 M guanidinium thiocyanate. 
50/0 mercaptoethanol. 50 mM tris-HCI. 50 mM EDTA, pH 7.0). DNA was fragmented by sonication. 1/10 vol 200/0 
sarcosine added and the solution warmed to 55* C in a water bath for two minutes. Gross tissue debris was 65 
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removed by centrifugation at 1000 rev/min for 10 minutes in a MSE 4L centrifuge. The solution was layered 
over a cushion of CsCl (5.7 M CsCI, 50 mM EDTA, pH 7.0; refractive index 1.3995) and centrifuged at 22,000 
rev/min (60 000 g^v), at 17^*0 for 48 hours in an !EC SB-110 rotor. . ^ „ . * ^ o 

The pellets were resuspended in sterile water and precipitated by adding 1/10 vol 3 M sodium acetate and 3 
vol absolute ethanoL The solution was kept at .20°C overnight, and the precipitated matenal recovered by 
centrifugation at 10.000 rev/min (8700 gav) at 4°C for 20 minutes in a Sorvail HB4 rotor. The pellets were 
washed in 700/o and 950/o ethanol and finally resuspended in sterile water at a concentration of approx. 1 

P^oiy(A)^ RNAs were isolated from total RNAs by the method of Aviv and Leder (1972) using oligo(dT)- 
celtulose (BRL). recovered by precipitation and washed as described above, and finally resuspended in sterile 
water at a concentration of 250 \iqlm\ and stored at -20° C. 

cDN A library construction ^ . * , /-,q7qx 

Double-stranded cDNAs were synthesised from poly(A)+ RNAs by the method of Wickens at al (1978) 
15 Oligo(dT)-primed poiy(A)+ RNA was reverse transcribed by AMV reverse transcriptase (Bio-Rad Laboratones) 
to generate a first strand with a hairpin loop which was used to prime second strand synthesis by E.coli DNA 
polymerase I (Boehringer). The hairpin loop was removed by digestion with SI nuclease and the resultant 
cDNA was blunt-end ligated into the Smal site of plasmid pUCS. The recombinant plasmids were used to 
transform E.coli JM83 and individual recombinant clones were grown on L-agar 9 cm plates. Individual colonies 
were picked inoculated, and grown in 96-weli microtitre plates (Flow Laboratories), duplicated and stored at 
-20" C Simultaneously, using a transfer plate (Dynatech), two nylon filter (Bk)dyne A, PALL) replicas of each 
plate were copied, and the bacterial DNA lysed and baked onto the filters for screening. The insert is recovered 
by digestion of the recombinant plasmid with Eco Rl and Bam HI by methods known in the art. 

Ti> cDNA probe preparation and colony hybridisation .,*v+oma 
All of the probes used to screen the libraries were single-stranded cDNAs synthesised from poly(A) RNAs 
using AMV reverse transcriptase (Bio-Rad Laboratories) and ^^p-dCTP (a-^^p-dCTP. 400 Ci/mmol. Amersham 
International pic) as label. Colony hybridisation (Grunstein and Hogness, 1975) to the nylon filter replicas of the 
cDNA libraries was can-ied out as described by the manufacturer (PALL) at 65°C for at least 12 hours using a 

30 probe concentration of 0,5-1x106 cpm/ml. Excess probe was removed by three half-hour washes in a washing 
buffer (5 mM NaH2P04. 1 mM EDTA. 0.20/o SDS) at GS^'C. Colony hybridisation was visualised by 
autoradiography at -70^C using Kodak X-Omat film and Dupont Ughtening Plus intensifying screens. 

Plasmid DNA isolation and dot-biot hvbridisation 

35 — Small-scale bacterial cultures (2 ml overnight cultures) were used for the Isolation of plasmid DNA by the 
method of Birnboim and Doly (1979). The DMAs were dot blotted onto Biodyne A nylon membrane filters, 
denatured and baked as described by the manufacturers (PALL) prior to hybridisation under conditions as 
described for colony hybridisation. For further study plasmids were Isolated from 500 ml overnight cultures, 
using the alkaline lysis method of Birnboim and Doly (1979). The plasmids were purified by CsCI and sucrose 

40 gradient centrifugation. Recombinant plasmids were finally resuspended in TE buffer (10 mM tris-HCI. 1 mM 
EDTA, pH 8,0) at a concentration of 250 p.g/ml and stored at 4°C. 

Northern blot analysis and dot-blot analysis of total RNA 
Total and poly(Ar RNAs in a buffer solution containing 500/o formamide and 2.2 M formaldehyde were 
45 heated to 65° C for 10 minutes, chilled on ice, and electrophoretically fractionated on Wo agarose-formalde- 
hyde gels prior to Northern blotting onto nitrocellulose as described by Thomas (1980). 

Serial doubling dilutions of total RNAs in sterile water were heated to 65°C for 15 minutes and chilled on ice 
before dot blotting onto nitrocellulose that had been previously wetted in 20X SSC (2 M NaCI, 0.3 M sodium 
citrate, pH 7.0) and air dried. The RNAs were immobilised onto the nitrocellulose by baking for 2 hours at 80° C. 



50 



55 



Southern blot analysis 

Restriction enzyme digested normal human white blood cell DNA, 18 \xg per lane, was electrophoretically 
fractionated overnight on 1o/o agarose gels, and then transferred to nitrocellulose using a modification of the 
method of Southern (1975). 



Hybridisation conditions ^^-m ^rv^ 

Recombinant plasmids were radioactively labelled by nick-translation using ^p-dCTP (a- p-dCTP. 400 
Ci/mmol Amersham International pic). Nitrocellulose filters were pre-hybridised in a buffer containing 500/o 
formamide, O.io/o SDS. 5X Denhardt^s (O.io/o ficol 400K MW, 0.I0/0 polyvinyl pyrolidine 360K MW, 0.1<yo bovine 

60 serum albumin). 5X SSC, 50 mM sodium phosphate . 500 jig/ml salmon sperm DNA, 10 p,g/ml each of poly{A) 
and poly(C). I0/0 glycine. pH 7.0, for at least 12 hours at 42°C. Hybridisations were can-ied out in a buffer 
containing 50P/o formamide. IQo/o dextran sulphate, 0,1o/o SDS. 5X SSC. IX Denhardts. 20 mM sodium 
phosphate. 100 ^tg/ml each of poly(A) and poly(C). pH 7.0, for at least 12 hours at 42° C with a probe 
concentration of 0.5-1x106 cpm/ml. Following hybridisations filters were washed at 65°C in 2X SSC. O.io/o 

65 SDS then 0,5X SSC. O.io/o SDS and finally 0.1X SSC. 0.10/0 SDS. and exposed to Kodak X-Omat film with 
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intensifying screens at -70*' C. 
RESULTS 

Screening of cDNA libraries 5 

A cDNA library of approx. 5000 clones representative of normal colonic mucosa poly(A)* RNAs was 
screened with probes generated from poIy(A)+ RNAs according to the scheme outlined in the screening 
protocol set out below. 
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Screening protocol for cDNA libraries: Identification of 
recombinants associated with tumour stage . 
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Normal colonic mucosa 
cDNA library 
about 5000 clones 



Colony hybjiidlsation I I 

I 

probes^: normal mucosae(4)^ 
I 
I 

912 clones 
I 
I 

Colony hybridisation 2 1 

t 

probes: normal mucosae(2) 
colorectal ca(5) 
I 
I 
I 

89 clones 
I 
I 

Plasmid DNA dot-blot I 
hybr idisa t ion 1 

probes: normal mucosae(3) 
colorectal ca(3) 
liver metastases(3) 



Liver metastasis 

cDNA library 
about 3000 clones 

r 

I 
I 

probes: liver metastases(2) 
colorectal caC4) 
I 

288 clones 
I 
I 
I 
I 



probes: 



probes : 



normal mucosae(3) 
colorectal ca(3) 
liver metastases(3) 
normal liver(l) 
I 

82 clones 



normal mucosae(3) 
colorectal ca(3) 
liver metastases(3) 
normal llver( 1) 



65 



a. AH probes were ^Sp-dCTP-labelled cDNAs reverse transcribed from polyCA)* RNAs. 
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b. Figures in brackets indicate the number of different tissue specimens used to generate cDNA 
probes at each stage of screening. 

Initially, in order to accommodate any inter-patient variation in gene expression, the library was screened 
with cDNA probes transcribed from RNAs from four different normal mucosae. On this basis 912 recombinant 
clones representing polylA)"^ RNA sequences of high and medium abundance classes were identified as being 5 
common to the mucosae RNAs. Further screening of these recombinants, firstly with cDNA probes derived 
from two normal mucosae specimens, to establish a base-line relative hybridisation pattern, and secondly with 
cDNA probes derived from five different colonic tumours, identified 89 recombinants representing abundant 
sequences in normal colonic mucosae which were of signrficantly altered abundance in colorectal tumours on 
the basis of differences in the intensities of the autoradiographic signals. io 

Since the results of Grunstein-Hogness colony screening depend not only on the degree of specific 
hybridisation with the probes used but also on a number of variables, such as growth of a particular clone in 
the microtitre plate, the reproducibility of transfer and growth of bacterial colonies on the nylon filters, and 
recombinant plasmid copy number, plasmid DNA was isolated as described from each of the 89 recombinants 
and dot blotted in duplicate onto each of four replica nylon filters. Hybridisation of plasmid DNA dot blots wrth 15 
cDNA probes generated from normal colonic mucosae, colonic tumours, and liver metastases of colonic 
tumours was carried out sequentially such that no individual filter was re-hybridised wfth an identical class of 
probe, each filter was hybridised with each of the aforementioned classes of probe, and three different tissue 
specimens from each histological tissue type were used. 

The colony hybridisation and plasmid DNA dot blot assays identified a number of recombinant clones as 20 
being representative of RNAs associated with different stages of tumour progression. These clones are 
detailed In Table 1. From the normal colonic mucosa library, six clones were identified as representing 
sequences of considerably reduced abundance in. or absence from, secondary tumours compared to primary 
tumours or normal tissue. In addition, a group of seven clones aJso represented sequences of reduced 
abundance in secondary tumours compared to normal tissue and primary tumours, which were assigned to a 25 
separate group on the basis of this semi-quantitative screening. A further group of seven clones appeared to 
represent sequences of increased abundance in primary tumours compared to normal tissue or secondary 
tumours. No clones apparently representing secondary tumour-specific sequences were identified in the 
normal colonic mucosa library. 

In order to identify sequences specifically associated with metastasis, a cDNA library of approx. 30CX} clones 30 
representing the more abundant poly(A)^ RNAs of a liver metastasis from a colorectal tumour was also 
screened as outlined in the screening protocol. Sequences common to liver metastases, but of altered 
abundance in primary tumours, were identffied by dFfferential screening with cDNA probes derived from two 
secondary and four primary colorectal tumours. Further screening of these selected clones with cDN A probes 
derived from liver metastases, primary tumours, normal colonic mucosae and normal human liver identified 82 35 
clones that represented RNA sequences in metastases which were of altered abundance In primary tumours, 
but were absent from the local RNA isolated from normal human liver. Plasmid DNA was isolated from these 
recombinant clones and hybridised with three different cDNAs transcribed from RNA from each histologically 
graded tissue type. 

This screening protocol identified two groups of clones in the liver metastasis library: a group of six clones 40 
representing sequences of increased abundance in primary tumours compared to normal colonic mucosa or 
secondary tumours, and a group of eight clones representing sequences of Increased abundance in 
secondary tumours compared to normal tissue or primary tumours (Table 1). 

On the basis of these semi-quantitative changes in hybridisation signal intensity, four recombinant clones 
from the normal colonic mucosa library and one recom'blnant clone from the liver metastasis library were 45 
selected for further characterisation by Northern blot and RNA dot-blot analysis. 

Characterisation of selected recombinants 

Following restriction enzyme digestion of recombinant plasmids. the cDNA inserts In the recombinants 
pNM19, pNM32, pNM41 and pNM61 from the normal colonic mucosa library, and pLM59 from the liver 50 
metastasis library, were shown to be of between 230-530 bp by agarose gel electrophoresis. Northern blotting 
of total RNA from a specimen of normal colonic mucosa, followed by hybridisation with nick-translated 
plasmids, identified these recombinants as being homologous to RNAs of between 0.8 and 2.1 kb (Fig. 1 and 
Table 2). In addition, sequences homologous to the recombinant clone pl-M59, which represented a RNA of 
increased abundance in metastases relative to normal tissue, could be detected in total RNAs from patients 55 
both with and without a clinical history of disseminated disease (Fig. 2). Southem blot analysis of EcoRI and 
Hindlll, digested normal human white blood cell DNA (results not shown) indicated ttiat each of these 
recombinants represented unique RNAs. 

Relative abundance of selected recorobinant-homoiogous RNAs 60 

The relative abundances of RNA sequences homologous to the five recombinants, that on the basis of prior 
semi-quantitative screening, were closely associated with metastases were determined In a series of tissue 
specimens corresponding to different stages of colorectal tumour progression (Rg. 3). Four of the 
recombinants, clones pNM19, pNM32, pNM41 and plSM61. were found to represent RNAs reduced 5- to 
10-fold in abundance in metastases relative to primary tumours, and 10- to 14-fold in metastases relative to 65 
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norma! mucosae. One recombinant, clone pLM59, represented a RNA showing a 4- to 6-fold increase in 
abundance in metastases relative to primary tumours and normal mucosae (Tables 3 and 4), To confirm that 
the obsen/ed differences in abundance of sequences homologous to the cloned cDNAs were due to 
differences in specific hybridisation and not to errors in the estimates of the RNA content of the samples, the 

5 same dot blots were stripped and reprobed with a cloned fragment of human 18S ribosomal DNA (results not 
shown) RNA dot-blot analysis also revealed considerable variation in the abundance of homologous RNAs to 
the cloned sequences at different stages of tumour progression (Table 3). That these differences were not 
attributable to degradation of the RNA samples concerned was shown by hybridisation of Northern blots to 
skeletal muscle actin (Shani et al. 1981). and Pz-microglobulin (Suggs et ai. 1981) cDNA probes (results not 

10 shown). 

DISCUSSION 

Molecular cloning of cDNAs representing individual mRNAs from the total poly(A)+ RNA populations of 
normal and neoplastic tissue of the colon has enabled us to identify changes in the abundance of specific 
15 RNAs reflecting phenotypic characteristics associated with tumour progression and metastasis. 

We previously screened cDN A libraries of about 1 000 recombinant clones from poIy( A)+ RNAs representing 
clinically metastasising and non-metastasising variants of colorectal tumours (Kerr et al, 1983) and. although 
quantitative RNA dot-blot hybridisation analysis identified cDNA clones con-esponding to sequences of 
greater abundance in RNAs from tumours compared to normal mucosae, no clones were found that 
20 consistently distinguished between localised and disseminated disease. The present study is based upon the 
random cloning of cDNAs representing the steady-state levels of total poly(A)* RNAs from normal mucosa and 
from a liver metastasis of a colorectal carcinoma. The frequency of a single cloned sequence in the cDNA 
library reflects the abundance of that sequence in the original mRNA population, and is ultimately determined 
by the turnover of the con-esponding mRNA. which in turn may depend on the parent tissue. The majonty of 
25 sequences comprising the abundant and moderately abundant classes of mRNA from the 10 000 - 30 000 
different sequences in typicai eukaryotic tissues may be expected to be represented in a cDNA library of 
between 5000 and 10.000 clones (Williams. 1981). Thus the libraries screened in this study should have been 
large enough to contain most abundant and moderately abundant sequences, although low abundance 
mRNAs will not have been well represented or protected. However, in a study of tissue-related differences in 
30 mRNA populations. Hastie and Bishop (1976) concluded that the most striking differences between tissues 
could be found among the abundant sequences. Furthermore, differences between normal and 
SV40-transformed human fibroblast mRNA populations could be ascribed to a few sequences of the high 
abundance mRNA classes (Williams et al. 1977). 

By screening two cDNA libraries we have identified a number of cDNA clones homologous to RNAs ot 
35 significantly reduced or increased abundance in metastases relative to neoplastic and nonnal colonic tissue. 
Screening those recombinants representing abundant RNAs common to nonnal colonic tissue specimens 
identified sequences expressed at different levels in primary and secondary tumours. Similarly, screening 
those recombinants representing sequences common to liver metastases of colorectal tumours identified 
sequences that represented RNAs associated with colorectal neoplasia and metastasis. Although 
40 histologically normal mucosa from tumour-bearing patients may exhibit phenotypic changes associated with 
the disease (Shamsuddin et al. 1981). these differences should not have severely masked those specifically 
associated with transformation or metastasis. 

On the basis of colony and plasmid DNA dot hybridisafions the vast majority of cDNA clones in the two 
libraries were found to correspond to abundant RNA sequences shared by both normal and neoplastic colon; 
45 most of these sequences were also present in the total poly(A)-^ RNA of normal liver. Those sequences 
showing some variation in abundance associated with tumour development (representing only 0,40^ of the 
total sequences examined) were poorly represented in the poly(A)* RNA of nonnal liver, suggesting that the 
changes in abundance that occur do so in sequences characteristic of the colon. Furthenmore. the clones we 
have identified suggest that the development of metastatic tumour cell populations in colorectal tumours is 
50 not solely due to the aberrant expression of one or two genes, but rather to the subtle alteration of multiple 
genetic loci. The results, however.do not allow any distinction to be made between changes in gene 
expression that may be associated with the prior existence of metastatic tumour cell populations (Fidler et al. 
1978) and those that may arise, for example, as a result of selected pressures exerted by the host 
microenvironment (Schirrmacher. 1980; Kerbel et al. 1984). 
55 Changes in gene expression are implied in tumour progression (Foulds, 1975) and the generation of donai 
diversity (Nowell, 1976) responsible for heterogeneity within tumours for a number of phenotypic 
characteristics, which may include metastatic capability (Fidler and Hart. 1982). However, the precise nature of 
the genetic events associated with tumour progression remain largely undetermined. A number of cellular 
oncogenes have been identified as being aben-antly expressed in colorectal cancer, including c-myc 
60 (Rothberg et al. 1985; Stewart et al. 1986) c-myb (Atlitalo et al. 1984) c-Ha-ras and c-Ki-ras (Spandidos and 
Kerr, 1984). but their role in colorectal tumour development is unclear. Although decreased levels of p21"^ 
were demonstrate in metastases, regardless of site, compared to primary colon tumours (Gallick et al, 1985) 
there appears to be no correlafion between the levels of ras-related cellular RNA and clinical outcome with 
regard to the development of metastatic disease (Kenr et al. 1986), Amplification of c-myc may be correlated 
65 with tumour metastasis (Yokota et al. 1986) and the transfection of cellular oncogenes (Thorgeirsson et al. 
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1985) has shown them to be involved in the acquisition of metastatic capability. Other, as yet uncharacterised. 
sequences (Berstein and Weinberg. 1985), and sequences such as those we have identrfied, may be related to 
the events associated with the activation of these and other genes during the metastatic process. 

These sequences may also prove to be of considerable prognostic value in predicting the metastatic 
capability of primary tumours at the time of surgery. 5 

EXAMPLE 2 

The insert sequence of formula I (as hereinbefore defined) from pLMSS was used to screen breast DNA 
samples to see if any alterations to the genomic sequences could be detected in any of these samples. To date 
35 primary breast tumours. 10 lymph node metastases from primary breast tumours and 25 DNA samples from io 
normal individuals (placenta or lymphocyte DNA) have been screened. Whilst amplification or rearrangement 
of the pLM59 genomic sequences has not been detected in any samples, an EcoRI RFLP has been detected. 
This RFLP (presence of a 8.5 kb fragment) appears to be more frequent in aggressive breast tumours than in 
normal samples although the survey size, particulariy of normal females is still relatively small. 10 mother-father 
pairs have been studied to find an informative pair, with one parent only having both sizes of the EcoRI ;5 
fragment. Two such pairs were identified, and in an extended study of all four grandparents plus four children 
from each family, it has been shown that the RFLP is inherited in a Mendelian manner as might be expected. 
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Recombinant cDNA dot blot hybridisation to cDNA probes 
from histologically graded tissues 



Origin of Relative hybridisation to cDNA probes 

recominant Number representing^ 

clones of clones^ Mucosa Primary Secondary 

tumour tumour 



Normal colonic 6 +^ + — 

mucosa 7 4- 4- +/- 

cDNA library 7 + 4-4- 4- 

Liver metastasis 8 4- + 4-4- 

cDNA library 54. +4. 4. 



a. 32p_iatielled single-strained cDNAs reverse transcribed from 
total poly(A)"*" RNA. 

b. Clones grouped on the basis of hybridisation of plasmid DNA dot 
blots with probes indicated; identical results obtained with 
probes derived from three specimins of each tissue tjrpe. 

c. Hybridisation signals: 4-I-, very strong; 4-, strong; — , weak or 
absent. 
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Table 2* Characteristics of five selected cloned sequences, 



cDNA clone i 



Size of cDNA 
insert (bp) 



Size of homologous 
RNA (kb) 



pNM19 
pNH32 
pNM41 
pNM61 
pLM59 



530 
485 
420 
230 
400 



1*2 
1.2 
1*9 
2a 
0,8 



Table 3. Relative abundances of five mRNAs in mucosae, polyps, 
carcinomas and metastases. 



Tissues 



pNM19 



Recombinant clones 
pNM32 pNM41 pNM61 



pLM59 



Mucosa (4)^ 
Polyp (3) 
Carcinoma (4) 
Ketastases (2) 



104^ 

88 
107 

lO 



320 
192 
32 
24 



98 
192 
32 
9 



130 
85 
32 
9 



5 
4 
8 
32 



a. Number of individual samples. 

b. Mean values of reciprocals of dilution endr-points 
determined by total RNA doubling-dilution dot— blot assay (see 
fig.4). 
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Table A. Abundances of homologous RNAs in metastases relative 
to mucosa and carcinoma. 



Abundance in Recombinant clone 

metastases pNM19 pNM32 pNM41 pNM61 pLH59 



Relative to mucosa 0.1 0,08 0,09 0.07 6.4 

Relative to carcinoma 0.1 0.75 0.28 0.28 4.0 



Claims 



10 



15 



20 



25 



30 



1. A polynucleotide sequence which is differentially expressed during malignant tumor progression and 
metastasis in colorectal cancer and fragments thereof. 

2. A polynucleotide sequence as claimed in claim 1 characterised in that the cDNA sequence 
corresponding thereto contains the sequence of formula I as herein defined or a sequence 
complementary thereto. 35 

3. A polynucleotide sequence as claimed in claim 1 characterised in that a cDNA sequence derived 
from said polynucleotide sequences comprises the insert sequence in pLM59 (NCIB No. 12429). 

4. A process for preparing a polynucleotide sequence as defined in any one of the preceding claims 
which comprises the use as a probe of the insert sequence in pLM59 or the polynucleotide sequence of 
formula I as defined in claim 2 or a fragment of such sequences having at least 8 consecutive nucleotides. 40 
to obtain the corresponding genomic DNA or RNA polynucleotide sequences as defined in any one of the 
preceding claims. 

6. A process for preparing a cDNA polynucleotide sequence as defined in any one of claims 1 to 3 which 
comprises synthesising the cDNA sequence from an RNA polynucleotide sequence as defined in any one 
of claims 1 to 3 by enzymatic techniques known per se. 45 

6. A process for preparing a polynucleotide sequence as defined in any one of claims 1 to 3 by chemical 
or enzymatic synthesis or by microbiological reproduction. 

7. A polypeptide or fragment thereof, encoded by a polynucleotide sequence as defined in any one of 
claims 1 to 3. 

8. An antibody effective to bind at least a fragment of the polypeptide as claimed in claim 7. 50 

9. A polynucleotide probe which comprises a nucleotide sequence capable of hybridising to a 
polynucleotide sequence which is differentially expressed during malignant tumour progression and 
metastasis in colorectal cancer and fragments thereof. 

10. A polynucleotide probe as claimed in claim 9 wherein the nucleotide sequence is capable of 
hybridising to a polynucleotide sequence as defined in claim 2 or claim 3 or a fragment thereof. 55 

11. A method for the diagnosis or prognosis of malignant disease which comprises detecting the 
presence or absence in a sample ot a polynucleotide or fragment thereof as defined in any one of claims 1 
to 3, a polypeptide or fragment thereof as defined in claim 7 or an antibody as defined in claim 8. 

12. A method of determining the presence or absence of a predisposition to malignant disease which 
comprises the use of a polynucleotide or fragment thereof as defined In any one of claims 1 to 3 In 60 
detecting the presence or absence of polymorphisms associated with the predisposition to malignant 
disease. 

13. A method of determining the presence or absence of a predisposition to malignant disease which 
comprises detecting the presence or absence of a mutation within the polynucleotide sequence defined 

in any one of claims 1 to 3. 65 
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